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Abstract 

We study the use of linear codes for network computing in single-receiver networks with 
various classes of target functions of the source messages. Such classes include reducible, 
injective, semi-injective, and linear target functions over finite fields. Computing capacity 
bounds and achievability are given with respect to these target function classes for network 
codes that use routing, linear coding, or nonlinear coding. 
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1 Introduction 

Network coding concerns networks where each receiver demands a subset of messages generated 
by the source nodes and the objective is to satisfy the receiver demands at the maximum possible 
throughput rate. Accordingly, research efforts have studied coding gains over routing ifTl fTOlfTTTl . 
whether linear codes are sufficient to achieve the capacity 0|71[T4l[T6]|, and cut- set upper bounds 
on the capacity and the tightness of such bounds [fT0l[TTll23l . 

Network computing, on the other hand, considers a more general problem in which each re- 
ceiver node demands a target function of the source messages flH[H[T5l[T71|2lll22l- Most problems 
in network coding are applicable to network computing as well. Network computing problems 
arise in various networks including sensor networks and vehicular networks. 

In Hi, a network computing model was proposed where the network is modeled by a directed, 
acyclic graph with independent, noiseless links. The sources generate independent messages and a 
single receiver node computes a target function / of these messages. The objective is to character- 
ize the maximum rate of computation, that is, the maximum number of times / can be computed 
per network usage. Each node in the network sends out symbols on its out-edges which are arbi- 
trary, but fixed, functions of the symbols received on its in-edges and any messages generated at 
the node. In linear network computing, this encoding is restricted to be linear operations. Existing 
techniques for computing in networks use routing, where the codeword sent out by a node consists 
of symbols either received by that node, or generated by the node if it is a source (e.g. [fT9l ). 

In network coding, it is known that linear codes are sufficient to achieve the coding capacity 
for multicast networks 0], but they are not sufficient in general to achieve the coding capacity 
for non-multicast networks [6). In network computing, it is known that when multiple receiver 
nodes demand a scalar linear target function of the source messages, linear network codes may not 
be sufficient in general for solvability 11201 . However, it has been shown that for single-receiver 
networks, linear coding is sufficient for solvability when computing a scalar linear target function 
PI 1211. Analogous to the coding capacity for network coding, the notion of computing capacity 
was defined for network computing in [[81 and is the supremum of achievable rates of computing 
the network's target function. 

One fundamental objective in the present paper is to understand the performance of linear net- 
work codes for computing different types of target functions. Specifically, we compare the linear 
computing capacity with that of the (nonlinear) computing capacity and the routing computing 
capacity for various different classes of target functions in single-receiver networks. Such classes 
include reducible, injective, semi-injective, and linear target functions over finite fields. Infor- 
mally, a target function is semi-injective if it uniquely maps at least one of its inputs, and a target 
function is reducible if it can be computed using a linear transformation followed by a function 
whose domain has a reduced dimension. Computing capacity bounds and achievability are given 
with respect to the target function classes studied for network codes that use routing, linear coding, 
or nonlinear coding. 

Our specific contributions will be summarized next. 
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1.1 Contributions 

Section [2] gives many of the formal definitions used in the paper (e.g. target function classes and 
computing capacity types). We show that routing messages through the intermediate nodes in a 
network forces the receiver to obtain all the messages even though only a function of the messages 
is required (Theorem 12 .101 ), and we bound the computing capacity gain of using nonlinear versus 
routing codes (Theorem 12. 121 ). 

In Section [3j we demonstrate that the performance of optimal linear codes may depend on how 
'linearity' is defined (Theorem 13 .21) . Specifically, we show that the linear computing capacity of a 
network varies depending on which ring linearity is defined over on the source alphabet. 

In Sections H] and |5l we study the computing capacity gain of using linear coding over routing, 
and nonlinear coding over linear coding. In particular, we study various classes of target functions, 
including injective, semi-injective, reducible, and linear. The relationships between these classes 
is illustrated in Figured] 

Section @] studies linear coding for network computing. We show that if a target function 
is not reducible, then the linear computing capacity and routing computing capacity are equal 
whenever the source alphabet is a finite field (Theorem 14.81 ); the same result also holds for semi- 
injective target functions over rings. We also show that whenever a target function is injective, 
routing obtains the full computing capacity of a network (Theorem 14.91 ), although whenever a 
target function is neither reducible nor injective, there exists a network such that the computing 
capacity is larger than the linear computing capacity (Theorem 14.1 II) . Thus for non-injective target 
functions that are not reducible, any computing capacity gain of using coding over routing must 
be obtained through nonlinear coding. This result is tight in the sense that if a target function is 
reducible, then there always exists a network where the linear computing capacity is larger than 
the routing capacity (Theorem l4.12l) . We also show that there exists a reducible target function and 
a network whose computing capacity is strictly greater than its linear computing capacity, which 
in turn is strictly greater than its routing computing capacity. (Theorem 14. 141 ). 

Section [5] focuses on computing linear target functions over finite fields. We characterize the 
linear computing capacity for linear target functions over finite fields in arbitrary networks (Theo- 
rem [5]6]). We show that linear codes are sufficient for linear target functions and we upper bound 
the computing capacity gain of coding (linear or nonlinear) over routing (Theorem 15.71) . This up- 
per bound is shown to be achievable for every linear target function and an associated network, in 
which case the computing capacity is equal to the routing computing capacity times the number of 
network sources (Theorem 15. 81 ). 

Finally, Section|6]studies an illustrative example for the computing problem, namely the reverse 
butterfly network - obtained by reversing the direction of all the edges in the multicast butterfly 
network (the butterfly network studied in [1J illustrated the capacity gain of network coding over 
routing). For this network and the arithmetic sum target function, we evaluate the routing and 
linear computing capacity (Theorem 16.11 ) and the computing capacity (Theorem 16.3b . We show 
that the latter is strictly larger than the first two, which are equal to each other. No network with 
such properties is presently known for network coding. Among other things, the reverse butterfly 
network also illustrates that the computing capacity can be a function of the coding alphabet (i.e. 
the domain of the target function /). In contrast, for network coding, the coding capacity and 
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routing capacity are known to be independent of the coding alphabet used 0. 
Our main results are summarized in Table [TJ 



All target functions 



Reducible 




Figure 1: Decomposition of the space of all target functions into various classes. 



Result 


/ 


A 


Location 


V/VAAC lin (AT,/) = C roat {Af,f) 


non-reducible 


field 


Theorem US] 


semi-injective 


ring 


V/VyVC cod (/V,.f) =C aat (Af,f) 


injective 




Theoreml4.9l 


V/3AAC cod (AA,/) >C M {M,f) 


non-injective & non-reducible 


field 


Theorem 14.1 11 


V/ 3MC M (M,f) >C mut (M,f) 


reducible 


ring 


Theorem 14. 121 


3/ 37V C cod (/V, /) > CHn(JV, /) > C rou t(/V /) 


reducible 




Theorem |4J4J 


V/ VAA C cod (/V, /) = C^M, f) < s C rout (AA, /) 


linear 


field 


Theoreml5.7l 


Vf3N-CUN,f) = sC mut (Af,f) 


linear 


field 


Theorem [53] 


3f 37V C co d(7V, /) is irrational 


arithmetic sum 




Theorem 16. 31 



Table 1: Summary of our main results for certain classes of target functions. The quantities C co d(7V, /), 
Ciin (TV, /)> and C rou t(7V, /) denote the computing capacity, linear computing capacity, and routing 
computing capacity, respectively, for a network TV with s sources and target function /. The columns 
labeled / and A indicate contraints on the target function / and the source alphabet A, respectively. 
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2 Network model and definitions 

In this paper, a network N = (G, S, p) consists of a finite, directed acyclic multigraph G = (V, S), 
a set S = {(Ti, . . . , a s } C V of s distinct source nodes and a single receiver p E V. We assume 
that p E 1 S, and that the graphQ G contains a directed path from every node in V to the receiver p. 
For each node u E V, let £i n (u) and £ ou t(u) denote the in-edges and out-edges of u respectively. 
We assume (without loss of generality) that if a network node has no in-edges, then it is a source 
node. If e = (u, v) E £, we will use the notation head(e) = u and tail(e) = v. 

An alphabet is a finite set of size at least two. Throughout this paper, A will denote a 
source alphabet and B will denote a receiver alphabet. For any positive integer m, any vec- 
tor x E A"\ and any % E {1,2,..., m}, let x\ denote the z-th component of x. For any index 
set / = {ii,i 2 , . . . ,i q } C {1,2,..., m} with zi < i 2 < . . . < i q , let xi denote the vector 
Xi 2 , . . . , Xj ) G ^4' 7 '. Sometimes we view A as an algebraic structure such as a ring, i.e., with 
multiplication and addition. Throughout this paper, vectors will always be taken to be row vectors. 
Let W q denote a finite field of order q. A superscript t will denote the transpose for vectors and 
matrices. 

2.1 Target functions 

For a given network J\f = (G, S, p), we use s throughout the paper to denote the number \S\ of 
receivers in Af. For given network J\f, a target function is a mapping 

The goal in network computing is to compute / at the receiver p, as a function of the source 
messages. We will assume that all target functions depend on all the network sources (i.e. a target 
function cannot be a constant function of any one of its arguments). Some example target functions 
that will be referenced are listed in Tabled 



Target function / 


Alphabet A 


/ (^l j • ■ • j^s) 


Comments 


identity 


arbitrary 


(xi , . . . , X s ) 


B = A S 


arithmetic sum 


{0,l,...,g-l} 


xi + x 2 H \- x s 


'+' is ordinary integer addition, 
B = {0,1,--- ,s(q-l)} 


mod r sum 


{0,l,...,g-l} 


X\ © x 2 © ••• © x s 


© is mod r addition, B = A 


linear 


any ring 


a\X\ + (22^2 + . . . + a s x s 


arithmetic in the ring, B = A 


maximum 


any ordered set 


max {x\, . . . , x s } 


B = A 



Table 2: Definitions of some target functions. 



Throughout the remainder of the paper, we use "graph" to mean a multigraph, and in the context of network 
computing we use "network" to mean a single-receiver network. 
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Definition 2.1. Let alphabet A be a ring. A target function / : A s — > B is said to be reducible 
if there exists an integer A satisfying A < s, an s x A matrix T with elements in A, and a map 
# : A x — ► i3 such that for all x G A s , 

g{xT) = f(x). (1) 

Reducible target functions are not injective, since, for example, if x and y are distinct elements 
of the null-space of T, then 

f(x)=g(xT)=g(0)=g(yT) = f(y). 
Example 2.2. Suppose the alphabet is A = F 2 and the target function is 

/:F3_>{ 0> 1}, 

where 

/(*) 

Then, by choosing A = 2, 

T 

and 0(2/1, 2/ 2 ) = 2/12/2, we get 

c/(xT) 

Thus the target function / is reducible. 

Example 2.3. The notion of reducibility requires that for a target function / : A s — > B, the 
set A must be a ring. If we impose any ring structure to the domains of the identity, arithmetic 
sum, maximum, and minimum target functions, then these can be shown (via our Example 14.21 and 
Lemma H31) to be non-reducible. 

2.2 Network computing and capacity 

Let k and n be positive integers. Given a network J\f with source set S and alphabet A, a message 
generator is any mapping 

a : S—>A k . 

For each source Oi E S, a(<Ji) is called a message vector and its components 

a{a i ) l ,...,a{a i ) k 



= (xi + x 2 )x 3 . 




= g(xt +x 2 ,x 3 ) 
= (xi + x 2 )x 3 

= /(*). 
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are called message^. 

Definition 2.4. A (k, n) network code in a network J\f consists of the following: 

(i) Encoding functions h^ e \ for every out-edge e G £ ou t{v) of every node v G V — p, of the 
form: 

h {e) : I \\ A n x A k — »• .A n if u is a source node 

yeg£ in (v) / 

/i (e) : JJ A n — -» .A" otherwise. 

ee£in(f) 

(ii) A decoding function ip of the form: 

v>: n ^ n ^^ fe - 

Furthermore, given a (A;, n) network code, every edge e £ £ carries a vector z e of at most n 
alphabet symbols^, which is obtained by evaluating the encoding function on the set of vectors 
carried by the in-edges to the node and the node's message vector if the node is a source. The 
objective of the receiver is to compute the target function / of the source messages, for any arbitrary 
message generator a. More precisely, the receiver constructs a vector of k alphabet symbols, such 
that for each i G {1,2,..., k}, the i-th component of the receiver's computed vector equals the 
value of the desired target function /, applied to the i-th components of the source message vectors, 
for any choice of message generator a. 

Definition 2.5. Suppose in a network J\f, the in-edges of the receiver are e±, e 2 , . . . , e\£ in ( P )\- A 
(k, n) network code is said to compute f in Af if for each j G {1,2, ... ,k}, and for each message 
generator a, the decoding function satisfies 

V> [z ei , ■■■ , z e|£<nW| ) . = / [H(Tx) j , a(a s )^ . (2) 

If there exists a [k, n) code that computes / in J\f, then the rational number k/n is said to be an 
achievable computing rate. 

In the network coding literature, one definition of the coding capacity of a network is the 
supremum of all achievable coding rates [5J. We use an analogous definition for the computing 
capacity. 

Definition 2.6. The computing capacity of a network N with respect to a target function / is 

Cco&{N, f) = sup I — : 3 (k,n) network code that computes / in A/" j . 

2 For simplicity we assume each source has associated with it exactly one message vector, but all of the results in 
this paper can readily be extended to the more general case. 
3 By default, we assume that edges carry exactly n symbols. 
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The notion of linear codes in networks is most often studied with respect to finite fields. Here 
we will sometimes use more general ring structures. 

Definition 2.7. Let alphabet A be a ring. A (k, n) network code in a network J\f is said to be a 
linear network code (over A) if the encoding functions are linear over A. 

Definition 2.8. The linear computing capacity of a network J\f with respect to target function / is 
C]in{N,f) = sup|— : 3 (k, n) linear network code that computes / in A/"j. 

The routing computing capacity C wut (M, f) is defined similarly by restricting the encoding 
functions to routing. We call the quantity C cod (Af, f) — C lin (Af, f) the computing capacity gain of 
using nonlinear coding over linear coding. Similar "gains", such as, C cod (Af, f) — C mut (Af, f) and 
Cii n (AT, /) - C mut (Af, /) are defined. 

Note that Definition 12 . 7 1 alio w s linear codes to have nonlinear decoding functions. In fact, since 
the receiver alphabet B need not have any algebraic structure to it, linear decoding functions would 
not make sense in general. We do, however, examine a special case where B = A and the target 
function is linear, in which case we show that linear codes with linear decoders can be just as good 
as linear codes with nonlinear decoders (Theorem l5.7l) . 

Definition 2.9. A set of edges C C £ in network J\f is said to separate sources cr mi , . . . , a md from 
the receiver p, if for each i E {1,2, ... ,d}, every directed path from a mi to p contains at least one 
edge in C. Define 

Ic = {i ■ C separates Oi from the receiver} . 

The set C is said to be a cut in J\f if it separates at least one source from the receiver (i.e. \Ic\ > 1). 
We denote by A (A/") the collection of all cuts in N . 

Since I c is the number of sources disconnected by C and there are s sources, we have 

\Ic\ < s. (3) 

For network coding with a single receiver node and multiple sources (where the receiver de- 
mands all the source messages), routing is known to be optimal ||23~1 . Let C rout (A0 denote the 
routing capacity of the network J\f, or equivalently the routing computing capacity for comput- 
ing the identity target function. It was observed in [|23l Theorem 4.2] that for any single-receiver 
network J\f, 

C rout (A0= min M. (4) 
ceA(j\f) \I C \ 

The following theorem shows that if the intermediate nodes in a network are restricted to perform 
routing, then in order to compute a target function the receiver is forced to obtain all the source 
messages. This fact motivates the use of coding for computing functions in networks. 
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Theorem 2.10. IfM is a network with target function f, then 

C mut (M, f) = C rout (M). 

Proof. Since any routing code that computes the identity target function can be used to compute 
any target function /, we have 

C Ioat (Mj)>C mut (M). 

Conversely, it is easy to see that every component of every source message must be received by p 
in order to compute /, so 

C IO u t (Mj)<C mut (M). 



Theorem 12.121 below gives a general upper bound on how much larger the computing capacity 
can be relative to the routing computing capacity. It will be shown later, in Theorem 15 .7 [ that for 
linear target functions over finite fields, the bound in Theorem 12 .121 can be tightened by removing 
the logarithm term. 

Lemma 2.11. IfM is network with a target function f : A s — > B, then 

C cod WJ)<£og 2 \A\) rmn |C|. 

CeA(AA) 

Proof. Using [4, Theorem II. 1], one finds the term min-cut(A/", /) defined in [4, Equation (3)] in 
terms of a quantity Ri c j, which in turn is defined in flU Definition 1.5]. Since target functions 
are restricted to not being constant functions of any of their arguments, we have Ri c j > 2, from 
which the result follows. ■ 

Theorem 2.12. IfM is network with a target function f : A s — > B, then 

C C o d (M,f)<s(log 2 \A\)C rout (M,f) 

Proof. 

CcodiM, f) < (log 2 U|) min |C| [from Lemma[2Jl] 

CeA(M) 

< s (log 2 \A\) C [0Ut (M, f) . [from ©, ©, and Theorem[2T0]] 
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3 Linear coding over different ring alphabets 

Whereas the size of a finite field characterizes the field, there are, in general, different rings of the 
same size, so one must address whether the linear computing capacity of a network might depend 
on which ring is chosen for the alphabet. In this section, we illustrate this possibility with a specific 
computing problem. 

Let A = {a , a 1; a 2 , a 3 } and let / : A 2 — > {0, 1, 2} be as defined in Table [3j We consider 



/ 


a 


a% 


d2 


a-3 


a 





1 


1 


2 


d\ 


1 





2 


1 


a 2 


1 


2 





1 


a 3 


2 


1 


1 






Table 3: Definition of the 4-ary map /. 



different rings R of size 4 for A and evaluate the linear computing capacity of the network Ai 
shown in Figure [2] with respect to the target function /. Specifically, we let R be either the ring 
Z4 of integers modulo 4 or the product ring Z 2 x Z 2 of 2-dimensional binary vectors. Denote the 
linear computing capacity here by 

C\i a {N\) R = sup I — : 3 (k, n) i?-linear code that computes / in A/" j . 
The received vector z at p can be viewed as a function of the source vectors generated at a 1 and 
• 

<7X °2 P 



Figure 2: Network A/i has two sources u\ and cr 2 and a receiver p. 



a 2 . For any (k, n) i?-linear code, there exist k x n matrices M a and M 2 such that z can be written 
as 



2(0(0-0 , a(<7 2 )) = a(<7i) M l + a(a 2 ) M 2 . (5) 
Let 771* 1, ■ • • , m ii( . denote the row vectors of Mj, for i G {1, 2}. 

Lemma 3.1. Le? ^4 be the ring Z 4 anJ let f : ^4 2 — > {0, 1, 2} the target function shown in 
Table\3\ where di = i, for each i. If a (k,n) linear code over A computes f in A/i and p receives 
a zero vector, then a{a\) = a(a 2 ) £ {0, 2} k . 
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Proof. If a((Ji) = a(a 2 ) = 0, then p receives a by © and must decode a since /((0, 0)) = 
(from Table©. Thus, p always decodes a upon receiving a 0. But /((xi, x 2 )) = if and only if 
x\ = X2 (from Table©, so whenever p receives a 0, the source messages satisfy a(<7i) = a(a 2 ). 

Now suppose, contrary to the lemma's assertion, that there exist messages a(cri) and a(a 2 ) 
such that z(a(oi) , a(a 2 )) = and a(a\)- ^ {0, 2} for some j G {1, 2, • ■ ■ , k}. Since a(a\)- is 
invertible in Z 4 (it is either 1 or 3), we have from © that 



j = -a(o"i)j 1 mi,, + ^ -"(o-Oj 1 Oi(a 2 ) i m 2 ,i (6) 



i=l 



= y {1) M 1 + y (2) M 2 (7) 
where y^ 1 ' and y( 2 ' are A;-dimensional vectors defined by 

(l) _ j -ai^j 1 tfi^J 



if i = j 



y 



(2) 



-^(o-i)^ 1 a(£r 2 )i- (8) 



Also, define the A;-dimensional vector x by 

[1 ifi=j. 

We have from © that 0) = m^j and from © and © that z^ 1 **, f/ 2 * 1 ) = mij. Thus, in order 
for the code to compute /, we must have f(xj, 0) = f(yf \ Vj 1 )- But f(xj,0) = /(l, 0) = 1 and 

/(yi 1) ,yf) = /(0,-a(ff 1 )7 1 a((T 2 ) i ) 

= /(0, -a^Oj 1 a(«7i)j.) [from a(tn) = a(a 2 )] 

= /(0,-l) 

= /(0,3) [from 3 = -1 in Z 4 ] 

= 2 [from Table [3], 

a contradiction. Thus, cn(eri) G {0,2} fe . ■ 

Theorem 3.2. 77ze network M\ in Figure\2\with alphabet A = {a , a±, a 2 , a^} and target function 
f : A 2 — > {0, 1, 2} shown in Table\3\ satisfies 

c&(M,/) ZaxZa = i. 

(For ^4 = Z4, we identify a{ = i, for each i, and for A = 1* 2 x 7L 2 , we identify each a* with the 
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2-bit binary representation ofi.) 

Proof. Consider a (k,n) Z 2 x Z 2 -linear code that computes /. From ©, we have z(x, 0) = 
whenever xM\ = 0. Since /((0, 0)) 7^ f((xi, 0)) (whenever Xj 7^ 0), it must therefore be the case 
that xMi = only when x = 0, or in other words, the rows of Mi must be independent, so n > k. 
Thus, 

C lin (A/\/f 2XZ2 <l. (10) 

Now suppose that A is the ring Z 2 x Z 2 where, a = (0, 0), a x = (0, 1), a 2 = (1, 0), and a 3 = (1, 1) 
and let © denote the addition over A. For any x £ A 2 , the value /(x), as defined in Tabled is 
seen to be the Hamming distance between x\ and x%. If k = n = 1 and Mi = M 2 = [03] (i.e., 
the lxl identity matrix), then p receives x\ © x 2 from which / can be computed by summing its 
components. Thus, a computing rate of k/n = 1 is achievable. From (TTOl) , it then follows that 

We now prove that Cn n (J\f, f) Zi < 2/3. Let .A denote the ring Z 4 where a; = z for < % < 3. 
For a given n) linear code over .4 that computes /, the n-dimensional vector received by p can 
be written as in ©. Let /C denote the collection of all message vector pairs (a(ai) , a(cr 2 )) such 
that z(o;(cri) , a(a 2 )) = 0. Define the 2k x n matrix 



M 



M 2 



and notice that /C = {y G A 2k : yM = 0}. Then, 



4™ = i^r 

> I {yM :yeA 2k }\ [from y G A 2k yM G A n ] 

> l^L- [from yW, y^ G A 2fc and j,WM = y^M y« - y( 2 ) G /C] 



1 2 A 



\A\ 

2 l 

4 3fc / 2 . [from \A\ = 4] 



> ^ k [from Lemma 134 



Thus, k/n < 2/3, so C Un (M, /) Z * < I- 
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4 Linear network codes for computing target functions 

Theorem 12. 101 showed that if intermediate network nodes use routing, then a network's receiver 
learns all the source messages irrespective of the target function it demands. In Section [47T1 we 
prove a similar result when the intermediate nodes use linear network coding. It is shown that 
whenever a target function is not reducible the linear computing capacity coincides with the routing 
capacity and the receiver must learn all the source messages. We also show that there exists a 
network such that the computing capacity is larger than the routing capacity whenever the target 
function is non-injective. Hence, if the target function is not reducible, such capacity gain must 
be obtained from nonlinear coding. Section B31 shows that linear codes may provide a computing 
capacity gain over routing for reducible target functions and that linear codes may not suffice to 
obtain the full computing capacity gain over routing. 

4.1 Non-reducible target functions 

Verifying whether or not a given target function is reducible may not be easy. We now define a 
class of target functions that are easily shown to not be reducible. 

Definition 4.1. A target function / : A s — > B is said to be semi-injective if there exists x G A s 
such that = {x}. 

Note that injective functions are semi-injective. 

Example 4.2. If / is the arithmetic sum target function, then / is semi-injective (since f(x) = 
implies x — 0) but not injective (since /(0, 1) = /(l, 0) = 1). Other examples of semi-injective 
target functions include the identity, maximum, and minimum functions. 

Lemma 4.3. If alphabet A is a ring, then semi-injective target functions are not reducible. 

Proof. Suppose that a target function / is reducible. Then there exists an integer A satisfying 
A < s, matrix T G A sxX , and map g : A x — > B such that 

g (xT) = f{x) for each x G A s . (11) 

Since A < s, there exists a non-zero d G A s such that dT = 0. Then for each x G A s , 

f(d + x)= g((d + x)T) = g(xT) = f(x) (12) 

so / is not semi-injective. ■ 

Definition 4.4. Let A be a finite field and let M. be a subspace of the vector space A s over the 
scalar field A. Let 

M ± = {y G A s : xy f = for all x G M] 
and let dim(.M) denote the dimension of M. over A. 
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Lemma 4.5. %If A is a finite field and M. is a subspace of vector space A s , then (M.^) 1 - = Ai. 

Lemma [4T61 will be used in Theorem 14.81 The lemma states an alternative characterization of 
reducible target functions when the source alphabet is a finite field and of semi-injective target 
functions when the source alphabet is a group. 

Lemma 4.6. Let M be a network with target function f : A s — > B and alphabet A. 

(i) Let A be a finite field, f is reducible if and only if there exists a non-zero d G A s such that 
for each a G A and each x G A s , 

f(ad + x) = f(x). 

(ii) Let A be a group, f is semi-injective if and only if there exists x G A s such that for every 
non-zero d G A s , 

f(d + x)^f(x). 

(The arithmetic in ad + x and d + x is performed component-wise over the corresponding A.) 

Proof, (i) If / is reducible, then there exists an integer A satisfying A < s, matrix T G *4. sxA , and 
map g : A x — > B such that 

g (xT) = f(x) for each x G A s . (13) 

Since A < s, there exists a non-zero d G A s such that dT = 0. Then for each a G A and each 

x G A s , 

f{ad + x) = g((ad + x)T) = g{xT) = f(x). (14) 

Conversely, suppose that there exists a non-zero d such that (PT41) holds for every a G A and every 
x G A s and let Ai be the one-dimensional subspace of A s spanned by d. Then 

f(t + x) = f(x) for every t G M, x G A s . (15) 

Note that dim(A^ _L ) = s — 1. Let A = s — 1, let T G ^4 sxA be a matrix such that its columns form 
a basis for M^, and let 1Z T denote the row space of T. Define the map 

g:TZ T ^ f(A s ) 

as follows. For any y G TZt such that y = xT for x G A s , let 

g(y)=g(xT)=f(x). (16) 

4 This lemma is a standard result in coding theory regarding dual codes over finite fields, even though the operation 
xy l is not an inner product (e.g. fPH Theorem 7.5] or |[T8l Corollary 3.2.3]). An analogous result for orthogonal 
complements over inner product spaces is well known in linear algebra (e.g. 1T31 Theorem 5 on pg. 286]). 
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Note that if y = x {1) T = x (2) T for ^ x {2 \ then 

(x^-x^)T = 

- x (2) G (M L )^ [from construction of T] 

- x (2) G M [from Lemma|43] 

/ ( a; ( 1 )) = / (( a ;( 1 )-x( 2 ))+x( 2 )) 

= /(x (2) ). [fromGSM 

Thus g is well defined. Then from (fT6l ) and Definition 12 .11 / is reducible. 

(ii) Since / is semi-injective, there exists a x E A s such that {x} = which in 

turn is true if and only if for each non-zero d G A s , we have f(d + x) ^ f(x). 

■ 

The following example shows that if the alphabet A is not a finite field, then the assertion in 
Lemma HToT X) may not be true. 

Example 4.7. Let A = Z 4 , let / : A — > A be the target function defined by f(x) = 2x, and let 
d = 2. Then, for all a £ A, 

f(2a + x) = 2(2a + x) 

= 2x [from 4 = in Z 4 ] 

= m 

but, / is not reducible, since s = 1. 

Theorem |4.8l establishes for a network with a finite field alphabet, whenever the target function 
is not reducible, linear computing capacity is equal to the routing computing capacity, and therefore 
if a linear network code is used, the receiver ends up learning all the source messages even though 
it only demands a function of these messages. 

For network coding (i.e. when / is the identity function), many multi-receiver networks have a 
larger linear capacity than their routing capacity. However, all single-receiver networks are known 
to achieve their coding capacity with routing [23]. For network computing, the next theorem shows 
that with non-reducible target functions there is no advantage to using linear coding over routingjfl 

Theorem 4.8. Let M be a network with target function f : A s — > B and alphabet A. If A is a 
finite field and f is not reducible, or A is a ring with identity and f is semi-injective, then 

C lin {Af, f) = C rout (Af, f). 

Proof. Since any routing code is in particular a linear code, 

Cim(A/\ f)>C mut (Af,f). 

5 As a reminder, "network" here refers to single-receiver networks in the context of computing. 
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Now consider a (k, n) linear code that computes the target function / in N and let C be a cut. 
We will show that for any two collections of source messages, if the messages agree at sources 
not separated from p by C and the vectors agree on edges in C, then there exist two other source 
message collections with different target function values, such that the receiver p cannot distinguish 
this difference. In other words, the receiver cannot properly compute the target function in the 
network. 

For each e E C, there exist k x n matrices M(e)i, . . . , M(e) s such that the vector carried on e 

is 

s 
4=1 

For any matrix M, denote its j-th column by Let w and y be different k x s matrices over 

A, whose j-th columns agree for all j ^ Iq. 

Let us suppose that the vectors carried on the edges of C, when the the column vectors of w are 
the source messages, are the same as when the the column vectors of y are the source messages. 
Then, for all e E C, 

s s 

Y J W {£] M{e) l = Y,y [€] M{e) l . (17) 

i=l 4=1 

We will show that this leads to a contradiction, namely that p cannot compute /. Let m be an 
integer such that if d denotes the m-th row of w — y, then d ^ 0. For the case where A is a field 
and / is not reducible, by Lemma HToT i). there exist a E A and x E A s such that ad ^ and 

f(ad + x)^f(x). (18) 

In the case where A is a ring with identity and / is semi-injective, we obtain (fT8~l) from Lemma HToT ii) 
in the special case of a = 1 . 

Let u be any k x s matrix over A whose m-th row is x and let v — u + a(w — y). From ( fT8l , 
the target function / differs on the m-th rows of u and v. Thus, the vectors on the in-edges of the 
receiver p must differ between two cases: (1) when the sources messages are the columns of u, 
and (2) when the sources messages are the columns of v. The vector carried by any in-edge of the 
receiver is a function of each of the message vectors a(crj), for j ^ I c , and the vectors carried by 
the edges in the cut C. Furthermore, the j-th columns of u and v agree if j ^ Ic- Thus, at least one 
of the vectors on an edge in C must change when the set of source message vectors changes from 
u to v. However this is contradicted by the fact that for all e E C, the vector carried on e when the 
columns of u are the source messages is 

s s s 

u {i) M(e)i = u {i) M(e)i + a - y W ))M(e); [from 

i=l i=l i=l 

s 

= Y,v (l) M(e) l (19) 

4=1 
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which is also the vector carried on e when the columns of v are the source messages. 

Hence, for any two different matrices w and y whose j-th columns agree for all j Ic, at 
least one vector carried by an edge in the cut C has to differ in value in the case where the source 
messages are the columns of w from the case where the source messages are the columns of y. 
This fact implies that 

(|^| n ) |c| > (|^| fe ) |/cl 



and thus 



k | CI 
< 



n \Ic\ 

Since the cut C is arbitrary, we conclude (using ©) that 

-< min ff\ = C t0Ut (N, f) . 
n ceA(A0 \Ic\ 

Taking the supremum over all (fc, n) linear network codes that compute / in J\f, we get 

C^{Mj)<C mut (M,f). 




p 

Figure 3: Network J\f 2jS has sources ai, cr 2 , . . . , cr s , each connected to the relay v by an edge and v 

is connected to the receiver by an edge. 

Theorem 14.81 showed that if a network's target function is not reducible (e.g. semi-injective 
target functions) then there can be no computing capacity gain of using linear coding over routing. 
The following theorem shows that if the target function is injective, then there cannot even be any 
nonlinear computing gain over routing. 
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Note that if the identity target function is used in Theorem 14.91 then the result states that there 
is no coding gain over routing for ordinary network coding. This is consistent since our stated as- 
sumption in Section|2]is that only single-receiver networks are considered here (for some networks 
with two or more receivers, it is well known that linear coding may provide network coding gain 
over network routing). 

Theorem 4.9. If JV is a network with an injective target function f, then 

C cod {Mj) = C rout {Nj). 

Proof. It follows from [|23l Theorem 4.2] that for any single-receiver network N and the identity 
target function /, we have C cod (Af, f) = C Iout (Af, f). This can be straightforwardly extended to 
injective target functions for network computing. ■ 

Theorem 14.81 showed that there cannot be linear computing gain for networks whose target 
functions are not reducible, and Theorem 14.91 showed that the same is true for target functions 
that are injective. However, Theorem 14.1 II will show via an example network that nonlinear codes 
may provide a capacity gain over linear codes if the target function is not injective. This reveals a 
limitation of linear codes compared to nonlinear ones for non-injective target functions that are not 
reducible. For simplicity, in Theorem 14.1 II we only consider the case when there are two or more 
sources. We need the following lemma first. 

Lemma 4.10. The computing capacity of the network J\f 2)S shown in Figure \3\ with respect to a 
target function f : A s — > B, satisfies 

C C0 d{N2,s, f) > min \ 1, — - \ . 

Proof. Suppose 

log w <1. (20) 

Let k — n — 1 and assume that each source node sends its message to node v. Let 

g : f(A s )^A 

be any injective map (which exists by (|2Q|)). Then the node v can compute g and send it to the 
receiver. The receiver can compute the value of / from the value of g and thus a rate of 1 is 
achievable, so C cod (A2, s , /) > 1. 
Now suppose 

Iog w |/(^)| >1. (21) 
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Choose integers k and n such that 



' e < - < , i™. (22) 



log^ -n-\og ]A] \f(A 
Now choose an arbitrary injective map (which exists by (l22l) ) 

g : (/(-4 s )) fc ^-4". 



Since n > k (by (1211) and (|22l) ). we can still assume that each source sends its A;-length message 
vector to node v. Node v computes / for each of the k sets of source messages, encodes those 
values into an n-length vector over A using the injective map g and transmits it to the receiver. The 
existence of a decoding function which satisfies © is then obvious from the fact that g is injective. 
From (|22l) . the above code achieves a computing rate of 

k 1 

n ~ log w \f(A')\~ 

Since e was arbitrary, it follows that the computing capacity C co d(A/2, s , /) is at least 1/ log^ |/ (^4 S )| 

■ 

Theorem 4.11. Let A be a finite field alphabet. Let s > 2 and let f be a target function that is 
neither injective nor reducible. Then there exists a network M such that 

CUM, f)>Ci in (Af, f). 

Proof. If M is the network J\f 2>s shown in Figure [3] with alphabet A, then 

Can (A/", /) = l/s [from TheoremgJ and ©] 

< min jl, J [from * > 2 and [/ (^)| < |^| s ] 

< Ccod(A^, /) . [from LemmagjO] 



The same proof of Theorem l4.1 ll shows that it also holds if the alphabet A is a ring with identity 
and the target function / is semi-injective but not injective. 



4.2 Reducible target functions 

In Theorem 14.121 we prove a converse to Theorem 14.81 by showing that if a target function is 
reducible, then there exists a network in which the linear computing capacity is larger than the 
routing computing capacity. Theorem 14.141 shows that, even if the target function is reducible, 
linear codes may not achieve the full (nonlinear) computing capacity of a network. 
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Theorem 4.12. Let Abe a ring. If a target function f : A s — > B is reducible, then there exists a 
network M such that 

C lin {M, f)>C mut {N, /). 

Proof. Since / is reducible, there exist A < s, a matrix T E A sxX , and a map g : A x — > f(A s ) 
such that 

g(xT) = f{x) for every x E A s . [from Definition [2J] (23) 

Let M denote the network A/2,8 with alphabet A and target function /. Let k = 1, n = X and let 
the decoding function be ip = g. Since n > 1, we assume that all the source nodes transmit their 
messages to node v. For each source vector 

x = (a(<7i) , a(a 2 ) , . . . ,a(a 8 )) 

node v computes xT and sends it to the receiver. Having received the n-dimensional vector xT, 
the receiver computes 

^{xT) = g(xT) [from = g] 

= f(x). [from®] 

Thus there exists a linear code that computes / in Af with an achievable computing rate of 

k _ 1 
n A 

> 1/s [from A < s — 1] 

= C rout (A0 [from©] 

which is sufficient to establish the claim. ■ 

For target functions that are not reducible, any improvement on achievable rate of computing 
using coding must be provided by nonlinear codes (by Theorem 14.81) . However, within the class 
of reducible target functions, it turns out that there are target functions for which linear codes are 
optimal (i.e., capacity achieving) as shown in Theorem 15 .7 [ while for certain other reducible target 
functions, nonlinear codes might provide a strictly larger achievable computing rate compared to 
linear codes. 

Remark 4.13. It is possible for a network J\f to have a reducible target function / but satisfy 
Ciin (A/ - , /) = Crout(A/", /) since the network topology may not allow coding to exploit the structure 
of the target function to obtain a capacity gain. For example, the 3-node network in Figure |4] with 
f{xi,x 2 ) = xi + x 2 and finite field alphabet A has 

CKn{M,f)=C mut {M,f) = l. 
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• • 

(T X P cr 2 

Figure 4: A network where there is no benefit to using linear coding over routing for computing /. 

Theorem 14.1 II shows that for every non-injective, non-reducible target function, some net- 
work has a nonlinear computing gain over linear coding, and Theorem 14.121 shows that for every 
reducible (hence non-injective) target function, some network has a linear computing gain over 
routing. The following theorem shows that for some reducible target function, some network has 
both of these linear and nonlinear computing gains. 

Theorem 4.14. There exists a network M and a reducible target function f such that: 

C cod {M, f) > C lin (AT, f) > C mut (Af, f) . 

Proof. Let N denote the network A/2,3 shown in Figure [3] with s = 3, alphabet A = F 2 , and let / 
be the target function in Example 12.21 The routing capacity is given by 

[from ©] (24) 

Let k = n = 1. Assume that the sources send their respective messages to node v. The target 
function / can then be computed at v and sent to the receiver. Hence, k/n = 1 is an achievable 
computing rate and thus 

Ccod(A/\/)>l. (25) 

Now consider any (k,n) linear code that computes / in J\f. Such a linear code immediately implies 
a (k, n) linear code that computes the target function g(x 1 , x 2 ) = x±x 2 in network A/2,2 as follows. 
From the (k, n) linear code that computes / in M, we get a 3A; x n matrix M such that the node v 
in network Af computes 

(a(cri) a(<r 2 ) a(a 3 )) M 

and the decoding function computes / from the resulting vector. Now, in A/2,2, we let the node v 
compute 

(a(cri) a(a 2 )) M 

and send it to the receiver. The receiver can compute the function g from the received n-dimensional 
vector using the relation g(x\, x 2 ) = f(x±, 0, x 2 ). Using the fact that the function g is not reducible 
(in fact, it is semi-injective), 

- < Clin (A/2,2, 

n 

= Crout (J^2,2 , g) [from Theorem [4T8] 

= 1/2. [from ©] 



C rou t(AT, /) = l/3. 
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Consequently, 

CvnWJ) < 1/2. (26) 
Now we will construct a (1, 2) linear code that computes / in J\f. Let k = 1, n = 2 and 

M = 

Let the sources send their respective messages to v while v computes 

(a(<7i) a(o- 2 ) «(cr 3 )) M 

and transmits the result to the receiver from which / is computable. Since the above code achieves 
a computing rate of 1/2, combined with (|26l) . we get 

C lm (A/\/) = l/2. (27) 

The claim of the theorem now follows from (|24l) . (|25l) . and (f27l) . ■ 
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5 Computing linear target functions 

We have previously shown that for reducible target functions there may be a computing capacity 
gain for using linear codes over routing. In this section, we show that for a special subclass of 
reducible target functions, namely linear target functions^ over finite fields, linear network codes 
achieve the full (nonlinear) computing capacity. We now describe a special class of linear codes 
over finite fields that suffice for computing linear target functions over finite fields at the maximum 
possible rate. 

Throughout this section, let J\f be a network and let k, n, and c be positive integers such that 
k/n = c. Each k symbol message vector generated by a source a G S can be viewed as a c- 
dimensional vector 

a{a) = (a(a) 1 , a(a) 2 a(a) c ) G ¥ qk 

where a(a) i G ¥ qn for each i. Likewise, the decoder tp generates a vector of k symbols from ¥ q , 
which can be viewed as a c-dimensional vector of symbols from ¥ qn . For each e G £, the edge 
vector z e is viewed as an element of ¥ q n. 

For every node u G V — p, and every out-edge e G £ out (u), we choose an encoding function 
whose output is: 

< ee£ m («) j=i (28) 

E 1s^ z e otherwise 

for some7f ) ,/3 J (e) G ¥ qn and we use a decoding function ip whose j-th component output ipj is: 

£ Sfz e for all j G{l,2,...,c} (29) 

ee£i„(p) 

for certain 5^ G ¥ qn . Here we view each as a function of the in-edges to e and the source 
messages generated by u and we view ip as a function of the inputs to the receiver. The chosen 
encoder and decoder are seen to be linear. 

Let us denote the edges in 8 by e±, e 2 , . . • , eigi. For each source cr and each edge ej G £ out (cr), 
let , . . . , Xc 6 ^ be variables, and for each ej G S in (p), let w^" 1 , . . . , w^' be variables. For every 
ej, ej G £ such that head(ei) = tail(ej), let be a variable. Let x, y, w be vectors containing all 
the variables xf 3 \ y£ , and \ respectively. We will use the short hand notation ¥[y] to mean 
the ring of polynomials F[- ■ ■ , y£?\ ■ ■ ■ ] and similarly for F[x, y, w). 

Next, we define matrices A T (x), F(y), and B{w). 

6 The definition of "linear target function" was given in Table[2] 
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(i) For each r G {1, 2, • • • , s}, let A T (x) be a c x \£\ matrix A T (x), given by 



xf ]) if ej G E out {o T) 



(A T (x)) u = r* — (30) 

otherwise 



(ii) Let F(y) be a |£| x |£| matrix, given by 



. if ei, ej G £ and head( e J = to'/f e^J 

= 1 n . (3!) 

otherwise 



(iii) Let B(w) be a c x \£\ matrix, given by 



= C" ^ e£M (32) 
otherwise. 



Consider an (nc, n) linear code of the form in (1281) — (1291) . 

Since the graph G associated with the network is acyclic, we can assume that the edges 
ei, e 2 , • • • are ordered such that the matrix F is strictly upper-triangular, and thus we can apply 
Lemma I5TT1 Let I denote the identity matrix of suitable dimension. 

Lemma 5.1. (Koetter-Medard £74] Lemma 2]) The matrix I — F(y) is invertible over the ring 

Lemma 5.2. (Koetter-Medard ftT4\ Theorem 3]) For s = 1 and for all t G {1, . . . , s}, the decoder 
in (|29| ) satisfies 

i> = a(a 1 )A T (P)(I-ni)r 1 B(5) t . 

Lemma 5.3. (Alon [2 Theorem 1.2]) Let ¥ be an arbitrary field, and let g = g(x±, . . . , x m ) be 
a polynomial in ¥[x\, . . . , x m }. Suppose the degree deg(g) of g is YliLi t*> where each U is a 
nonnegative integer, and suppose the coefficient of \Xa=i %t i ^ n 9 25 nonzero. Then, if Si, . . . , S m 
are subsets of ¥ with \Si\ > t it there are si G Si, s 2 G S 2 , ■ ■ ■ , s m G S m so that 

g(si, ...,s m )^0. 

For each r G {1,2,..., s}, define the c x c matrix 

M T (x, y, w) = A T (x)(I - F(y))~ 1 B(w) t (33) 

where the components of M T (x, y, w) are viewed as lying in ¥ q [x, y, w]. 

Lemma 5.4. If for allr G {1, 2, ... , s}, 

det (M T (x,y,w)) ^ 
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in the ring ¥ q [x,y,w], then there exists an integer n > and vectors /3,j,6 over ¥ qn such that 
for all t G {1,2, ... ,s} the matrix M T (f3, 7, 5) is invertible in the ring of c x c matrices with 
components in ¥ q n. 

Proof. The quantity 

det (f[M T {x,y,w) 

is a nonzero polynomial in ¥ q [x, y, w] and therefore also in ¥ q n [x, y, w] for any n > 1. Therefore, 
we can choose n large enough such that the degree of this polynomial is less than q n . For such an 
n, Lemma [531 implies there exist vectors (3, 7, 5 (whose components correspond to the components 
of the vector variables x, y, w) over ¥ q n such that 

det |jjM T (/3,7,5)j ^0. (34) 

and therefore, for all r G {1, 2, . . . , s} 

det (M r (/3,7,5))^0. 

Thus, each M r (/3, 7, 5) is invertible. 

■ 

The following lemma improves upon the upper bound of Lemma |2.1 1 l in the special case where 
the target function is linear over a finite field. 

Lemma 5.5. If H is network with a linear target function f over a finite field, then 

C cod (AT,f)< min \C\. 
ceA(A0 

Proof. The same argument is used as in the proof of Lemma |2.1 11 except instead of using Rj c j > 
2, we use the fact that Rj c j = \A\ for linear target functions. ■ 

Theorem 5.6. IfAf is a network with a linear target function f over finite field ¥ q , then 

Cun{M,f)= min \C\. 

CeA(AT) 



Proof. We have 



CUMj)<C coA {N,f) 



< min \C\ . [from Lemma [53 

CeA(AA) 

For a lower bound, we will show that there exists an integer n and an (nc, n) linear code that 

computes / with a computing rate of c = min |C|. 

ceA(A0 
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From Lemma [STTl the matrix / — F(y) in invertible over the ring ¥ g [x, y, w] and therefore also 
over F g n [x, y, w]. Since any minimum cut between the source o T and the receiver p has at least c 
edges, it follows from 031 Theorem that det (M T (x, y,w)) ^ for every r G {1, 2, . . . , s}. 
From Lemma IB~4l there exists an integer n > and vectors (3, 7, 5 over ¥ q n such that M T (/3, 7, 5) 
is invertible for every r 6 {1,2,..., s}. Since / is linear, we can write 

f(u\, ...,u 3 ) = a\U\ H h a s u s . 

For each r G {1,2, ... ,s}, let 

i r (/3) = a T (M T ((3, 7, A T ((3). (35) 

If a linear code corresponding to the matrices A T (f3), B(5), and -^(7) is used in network Af, then 
the c-dimensional vector over ¥ q n computed by the receiver p is 

s 

tfj = J2 "K) &r{PW - F (l))~ lB ( s Y [ from Lemma[531and linearity] 

T=l 

S 

= «(^r) «r (M T (/3, 7, A r (/3)(/ - F( 7 ))- 1 J B((5)' [from ©] 

T=l 

S 

= a T a(a r ) [from d33l ] 

r=l 

= (/(a(o-i)i > • • • > a(o-s)i) , • • • , /(a(o-i) c > • • • , a( ff s ) c )) 

which proves that the linear code achieves a computing rate of c. ■ 

Theorem 15 .71 below proves the optimality of linear codes for computing linear target functions 
in a single-receiver network. It also shows that the computing capacity of a network for a given 
target function cannot be larger than the number of network sources times the routing computing 
capacity for the same target function. This bound tightens the general bound given in Theorem |2.12| 
for the special case of linear target functions over finite fields. Theorem 15.81 shows that this upper 
bound can be tight. 

Theorem 5.7. IfM is network with s sources and linear target function f over finite field ¥ q , then 

Cun{M, f) = C cod {N, f) < s C mut {N, f) . 

7 Using the implication (1) ==>■ (3) in lfT4l Theorem 2]. 
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Proof. 

sC mut {Af,f)> min \C\ [from © and Theorem [OOl 

CeA(Af) 

> C cod (Af, f) [from Lemma [53] 

= min [CI. [from Theorem 151)1 

CeA(A/") 



We note that the inequality in Theorem 15.71 can be shown to apply to certain target functions 
other than linear functions over finite fields, such as the minimum, maximum, and arithmetic sum 
target functions. 

Theorem 5.8. For every s, if a target function f : A s — > A is linear over finite field ¥ q , then 
there exists a network M with s sources, such that 

CmiU, f) = sC mut (Af,f). 

Proof. Let J\f denote the network J\f 2jS shown in Figure [3] Then 

Cii n (AT, /) = 1 [from Theorem 153] 

C r0 ut(AT, /) = C rout (A0 [from Theorem|2T0] 

= l/s. [from ©] 
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6 The reverse butterfly network 

In this section we study an example network which illustrates various concepts discussed previ- 
ously in this paper and also provides some interesting additional results for network computing. 



Source o l 




Receiver 1 Receiver 2 p 

(a) The butterfly network (6) The reverse-butterfly network 

Figure 5: The butterfly network and its reverse A3. 

The network A3 shown in Figure (2b) is called the reverse butterfly network. It has S = 
{oi, cr 2 }, receiver node p, and is obtained by reversing the direction of all the edges of the multicast 
butterfly network shown in Figure Oa). 

Theorem 6.1. The routing and linear computing capacities of the reverse butterfly network A3 with 
alphabet A = {0, 1, . . . , q — 1} and arithmetic sum target function f : A 2 — > {0, 1, . . . , 2(q — 1)} 
are 

Crou t {NzJ)=C lin {NzJ) = l. 

Proof. We have 

MA3, /) = CroutCAs) [from Theoremg^] 

= 1. [from ©] 

■ 

Remark 6.2. The arithmetic sum target function can be computed in the reverse butterfly network 
at a computing rate of 1 using only routing (by sending o\ down the left side and cr 2 down the right 
side of the graph). Combined with Theorem [67TL it follows that the routing computing capacity is 
equal to 1 for all q > 2. 
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Theorem 6.3. The computing capacity of the reverse butterfly network A/3 with alphabet A 
{0, 1, . . . , q — 1} and arithmetic sum target function f : A 2 — > {0,1,..., 2(g — 1)} is 

C cod (M 3 ,f) 



log, (2q - i; 



Remark 6.4. The computing capacity C co a(J\fs, f) obtained in Theorem 16.31 is a function of the 
coding alphabet A (i.e. the domain of the target function /). In contrast, for ordinary network 
coding (i.e. when the target function is the identity map), the coding capacity and routing capacity 
are known to be independent of the coding alphabet used [5 ]. For the reverse butterfly network, if, 
for example, q = 2, then C CO d(A/3, /) is approximately equal to 1.26 and increases asymptotically 
to 2 as q — > 00. 

Remark 6.5. The ratio of the coding capacity to the routing capacity for the multicast butterfly 
network with two messages was computed in [[51 to be 4/3 (i.e. coding provides a gain of about 
33%). The corresponding ratio for the reverse butterfly network increases as a function of q from 
approximately 1.26 (i.e. 26%) when q = 2 to 2 (i.e. 100%) when q = 00. Furthermore, in contrast 
to the multicast butterfly network, where the coding capacity is equal to the linear coding capacity, 
in the reverse butterfly network the computing capacity is strictly greater than the linear computing 
capacity. 

Remark 6.6. Recall that capacity is defined as the supremum of a set of rational numbers k/n such 
that a (k, n) code that computes a target function exists. It was pointed out in [[51 that it remains 
an open question whether the coding capacity of a network can be irrational. Our Theorem 16.31 
demonstrates that the computing capacity of a network (e.g. the reverse butterfly network) with 
unit capacity links can be irrational when the target function to be computed is the arithmetic sum 
target function of the source messages. 

The following lemma is used to prove Theorem 16.31 

Lemma 6.7. The computing capacity of the reverse butterfly network A3 with A = {0,1, ... ,q— 1} 
and the mod q sum target function f is 

C cod (Af 3 ,f) = 2. 

Proof. The upper bound of 2 on C CO d(A3, /) follows from [4, Theorem II. 1]. To establish the 
achievability part, let k = 2 and n = 1. Consider the code shown in Figure [6] where '©' indicates 
the mod q sum. The receiver node p gets a(a\) l ©a(cr 2 ) 1 and a(ai) l ®a(a 2 ) l ©a(cxi) 2 ©a(cr 2 ) 2 
on its in-edges, from which it can compute a(a 1 ) 2 © a(a 2 ) 2 . This code achieves a rate of 2. ■ 

Proof of Theorem \6. 3\ We have 

C C od(A/\ /) < 2/ log 9 (2g - 1). [from El Theorem HI]] 

To establish the lower bound, we use the fact the that arithmetic sum of two elements from A = 
{0, 1, . . . , q — 1} is equal to their mod 2q — l sum. Let the reverse butterfly network have alphabet 
A = {0, 1, ... , 2(q — 1)}. From Lemma l6771 (with alphabet A), the mod 2q— 1 sum target function 
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U\ (72 




P 



Figure 6: The reverse butterfly network with a code that computes the mod q sum target function. 

can be computed in J\f at rate 2. Indeed for every n > 1, there exists a (2n, n) network code that 
computes the mod 2q — 1 sum target function at rate 2. So for the remainder of this proof, let 
k = 2n. Furthermore, every such code using A can be "simulated" using A by a corresponding 
(2n, [nlog g (2q — 1)] ) code for computing the mod 2q — 1 sum target function, as follows. Let n' 
be the smallest integer such that q n ' > (2g — l) n , i.e., n' = \n log (2q — 1)] . Let g : A n —> A n ' be 
an injection (which exists since q n> > (2q — l) n ) and let the function g^ 1 denote the inverse of g on 
it's image Let denote the first and last, respectively, halves of the message vector 

a(<Ti) G A 2n , where we view and x^ as lying in A n (since A C A). The corresponding 
vectors y^\y^ 2 > for the source a 2 are similarly defined. 

Figure [7] illustrates a (2n,n') code for network J\f using alphabet A where '©' denotes the 
mod 2q — 1 sum. Each of the nodes in J\f converts each of the received vectors over A into a vector 
over A using the function g~ x , then performs coding in Figure [6] over A, and finally converts the 
result back to A. Similarly, the receiver node T computes the component-wise arithmetic sum of 
the source message vectors a(a 1 ) and a(a 2 ) using 

Qj((Ti) + a(cr 2 ) 

= (g'\g(x w © x( 2 ) © y w © e tr 1 ^ 2 ) © y ( 2 ))), 
^ 1 (^(x (2) ©i/ (2) ))) 

= (x«©t/( 1 ),x( 2 )©y( 2 )). 



Page 29 of [32] 



Appuswamy-Franceschetti-Karamchandani-Zeger 



May 6,2011 




Figure 7: The reverse butterfly network with a code that computes the arithmetic sum target 

function. '©' denotes mod 2q — 1 addition. 

For any n > 1, the above code computes the arithmetic sum target function in N at a rate of 



k 



In 



n 



' ' [7ilog,(2 9 -l)l 



Thus for any e > 0, by choosing n large enough we obtain a code that computes the arithmetic 
sum target function, and which achieves a computing rate of at least 



log, (2g-l) 



e. 
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